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ABSTRACT . We study the relation between the trading behavior of agents 
and volatility in toy markets of adaptive inductively rational agents. We show 
that excess volatility, in such simplified markets, arises as a consequence of i) 
the neglect of market impact implicit in price taking behavior and of ii ) exces- 
sive reactivity of agents. These issues are dealt with in detail in the simple case 
without public information. We also derive, for the general case, the critical 
learning rate above which trading behavior leads to turbulent dynamics of the 
market. 

KEYWORDS: statistical mechanics. El Farol bar problem, minority games, 
market impact, adaptive learning. 



1. Introduction 



The El Farol bar problem (Arthur, 1994) and its later refinement, the Minority 



Game (MG) (Challet and Zhang, 1997), where devised to study the competitive 
interaction of boundedly rational agents in socio-economic contexts. In the El 
Farol bar problem agents face the binary choice of either attending or not a 



bar which is enjoyable only if it is not too crowded. Arthur (1994)| showed that 



inductive rationality is enough for the average attendance to self-organize around 
the comfort capacity of the bar. 

The El Farol bar context is simplified even further in the Minority Game. 
Again agents compete in forecasting what the majority will do, in order to be 
on the opposite side, which is the rewarding one. Agents, like speculators in 



© 2008 HERMES 



2 



M. Marsili and D. Challet 



a financial market ( Zhang, 199§| ), try to exploit predictability of the "market". 
This speculative pressure drives the system in a self-organized state where, on 
average, half of the agents take each of the two possible actions at each time. 

This confirms the naive expectation that speculative trading eliminates arbi- 
trages - i.e. predictable patterns - in markets. In addition, the MG allows to 
focus attention on the fiuctuations around the average and allows to address 
the question of whether and to which extent adaptive agents reduce or amplify 
fluctuations. Fluctuations in the MG are directly related to the profit losses and 
hence to wastes of resources. This is reminiscent of volatility in financial markets, 
which is an undesirable consequence of speculative trading. 

The MG describes in a nutshell the interplay between predictability and volatil- 
ity in a market with speculative trading. Since it allows to quantify both pre- 
dictability H and volatility (see below) , the MG allows to address questions of 
both information efficien cy (predictability) and ma rket stochasticity (volatility) . 
Much numerical work ( Challet and Zhang, 1997 ; Savit et at, 1999; Johnson et 



ai, 1998| ; [Cavagna, 199"9[]Challet and Marsih, 1999|; [Cavagna et a/., 1999]) as well 



as approximate analytical approaches ( Zhang, 1998 ; Challet and Zhang, 1998 ; 
Johnson et a/., 1999| ) have been devot ed to the understanding of the MC '^' . This 
has culminated in the exact solution (Challet et ai, 2000A; Marsili et ai, 2000) 
in the limit of infinitely many agents, thanks to statistical mechanics tools of 
disordered systems (Mezard et ai, 1987). The emerging picture is that the spec- 
ulative trading of agents actually minimizes market's predictability. As the num- 
ber of agents increases, predictability is reduced and beyond a critical number 
of agents, the market becomes unpredictable (to agents). At the same time also 
the volatility of the market decreases. Beyond the critical number, however, the 
market remains unpredictable but volatility starts increasing. In addition excess 
volatility due to crowd effects arise, which manifest in an anti-persistent behavior 
of the market ( |Savit et ai, 1999| ; phallet and Marsih, 1999| ). 

If the competitive aspects of the MG are related to predictability, from the 
point of view of volatility the MG is similar in nature to coordination games 
(Myerson, 1997). Indeed among all possible ways to exploit market's predictabil- 
ity, each agent has incentives to favor those which reduce market's volatility. 
Since the MG is a closed dynamical system, fluctuations inevitably depend on 
the behavior of agents. 

Here we study in detail the relation between volatility and the behavior of 
agents. There are two key aspects of agent behavior which have been found to 
dramatically affect volatility in the MG: 



1 The market impact and the extent to which agents account for it. In the 
original MG, agents neglect their market impact and behave as price tak- 
crsn. This behavior minimizes market's predictability but this leads to large 



t A collection of references on the Minority Game is available at the site 
http : //www.unif r . ch/econophysics/minority . 

■t Price taking behavior means that, when an agent considers counter-factually the results 
of the various choice he/she could have made, he/she does not account for the fact that the 
market "price" depends on the choice he/she actually made. If he/she had taken a different 
decision the "price" also would have changed. In few words, one may say that agents, rather 
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volatility increases (crowd effects) when the market is crowded. This ex- 
cess volatility is dramatically reduced as soon as agents start accounting, 
even approximately, for their impact. When agents behave strategically, 
properly accounting for their impact on the market, volatility is reduced to 
zero. 

2 Agents reactivity, i.e. how readily and strongly do agents react to the out- 
come of the game. Fast learning rates may dramatically enhance volatility 
in the MG, with respect to that with small learning rates. 



These results were found recently for the MG in its full complexity ([Challet 



et al., 2000 A; Marsili et al, 200C) and for a modified version of it (Cavagna 
et at, T999| )pi Here we derive these result in the simplest case where there is 



no public information. This case allows for a quite simple treatment where the 
result is not obscured by the technicalities needed to deal with the MG with 
information. 

In addition it allows to understand the origin of enhanced volatility for fast 
learning rates. This arises as a transition from a smooth dynamics to a more 
complex dynamical behavior which is reminiscent of chaotic or turbulent systems. 

Then we move to the MG with information and briefly review the results of 
Marsili et al. (2000) . In addition we derive a new result for the critical learning 
rate, as a function of a above which crowd effects arise in the MG. 

We close with some discussion of the results. 



2. The model without information 

The MG is a toy model of a market under the simplifying assumption that the 
market is a repeated game of N interacting agents. In real markets, agents can 
decide not to participate. In addition real agents trade at different frequencies. 
In the MG all agents have to trade at each time step. If ai{t) represents the 
investment of agent i at time t, his/her profit from this transaction is 

N 

u,{t) = aS)R[A{t)/Nl A{t) = ^ a,{t). (2.1) 

i=i 

The return R{A/N) depends on the aggregate investment A{t). This functional 
dependence embodies the market mechanism and it must be such that agents 
cannot extract a positive profit from just trading. Hence the game is a negative 
sum game: 



N 

E 

i=l 



u,{t) = A{t)R[A{t)/N] < 0, yt. (2.2) 



In addition we further assume that ai{t) can take only two values ±1. Hence 
agents choosing the minority sign a,; = —sign A gain a profit \R{A/N)\ whereas 

than playing against other rational agents, thinks as if they were playing against an exogenous 
price process. 

t Ichallet et al. (2000C) showed that the model introduced by ICavagna et al. (1999) has 
exactly the same collective behavior of the original MG in the limit JV — > oo. 



4 



M. Marsili and D. Challet 



the majority of agents loose an amount —\R{A/N)\. The simplest choice R{x) = 
—X will be made henceforth|^| 

Naively, each player faces the problem of anticipating what the majority will 
do, in order to do the opposite. This is similar in essence to the El Farol bar 



problem (Arthur, 1994) and to many other situations where players have to share 
a limited resource. 

The last ingredient of the MG is public information. Since the effects which 
we wish to focus on emerge even in the absence of public information, we shall 
introduce this ingredient later on. 

Let us generalize the model and allow agents to behave in a stochastic way. 
Hence we introduce the probabilities tt^ with which agent i plays action = +1 
(with Probjai — —1} = 1 — tt^). With tt,; = or 1 we recover the deterministic 
case, whereas tt^ ~ 1/2 means that agent i relies on the toss of a fair coin, to 
decide his/her action. Expected values over -ki are denoted by (...). 

For any state 7? = (tti, . . . , ttjv), we define 

N 

i/(7^) = (A)^ (A) = ^(27r, - 1) (2.3) 

1=1 

and 

JV 

a2(^)= =F + 4^7r,(l-7r,). (2.4) 

1=1 

H measures information efficiency. Indeed H > implies (A) ^ which means 
that one can predict the minority action —sign (A). When H = the outcome 
A is unpredictable. In both cases, cr^ is simply related to the total loss of agents 
cr^ = — iV^j(ui) and hence it measures optimality. Volatility can be defined as 
the excess fluctuations ~ H oi the market. 



3. Nash equilibria 

The key concept to analyze the outcome of strategic interactions are the Nash 



equilibria of game theory (Myerson, 1997). The MG is an n— players game, where 
each agent i may play pure strategies a,j = +1 or — 1 - or mixed strategies tt^ - 



and the payoff matrix is specified by Eq. (2.1). Nash equilibria (Myerson, 1997) 
are defined as those states tt in which agents have no payoff incentive to change 
their behavior. 

The purpose of this section is to provide a reference framework for our later 
discussion on adaptive agents. More precisely it is of interest to undestand under 
what conditions adaptive learning can lead to the outcome one would expect from 
rational players, i.e. to a Nash equilibrium. Hence we shall limit our discussion 
to a rather elementary levefl. 

Let us consider a state t?^-'^ where N~2j agents play mixed strategies tt.; — 1/2 

■t This is justified a posteriori by power expansion of R{x) in x because A(t) is typically of 
order \/lV. The original choice in ( |Challet.a,ud Zha.ng, 1997 lavit p.t al., 1 999 )_ lad R{x) = sign x 
which can also be dealt with similarly (tec appendix of Oliallet et al., 2UUUB). 

t In what follows, Nash equilibria are vectors tt such that no agent can increase his/her 
payoff by changing his/her mixes strategy ni, if others stick to theirs. 
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and 2j play pure strategies. The frozen agents playing pure strategies are divided 
into two groups of j agents playing opposite actions. This is a Nash equilibrium. 

In order to show this, we observe that the 2j agents playing pure strategies have 
no incentive to change it, because if they deviate from their strategy they suffer 
a loss. Agents playing mixed strategies must, on the other hand be indifferent 
between the two action, which occurs if {A — ai) = J^j^ii'^^j — 1) = 0. 

Overall, by simple combinatorics we find that there are 

Nash equilibria of this form (here [N/2] is the integer part of [Af/2]). For N even, 
the information efficiency and optimality of these equilibria are quantified by 

7J (t?*^') ) = 0, (tt*^') )^N - 2j. (3.2) 

This makes it clear that agents playing mixed strategies Trp"* = 1/2 are those 
who contribute to the value of cr^. Hence they are responsible for inefficiencies 
(excess volatility). As the number j oi frozen agent pairs increases the equilibrium 
becomes more and more efficient]^ and decreases. The equilibria t? where 
agents attain the minimal value of cr^, shall be called optimal henceforth. At the 
other extreme, the symmetric Nash equilibrium in which all agents play mixed 
strategies (ttI"' = 1/2 for all i) is the most inefficient. 

The number of frozen agents can increase if we allow for coordination devices 
such as binding agreements or public information. If agents i and k, playing 
(jTijiTk) = (1/2,1/2) coordinate their actions choosing to play either (ni^TTk) = 
(0,1) or (7ri,7rfe) = (1,0) with some probability, their payoffs increase (which 
increases the payoffs of other agents as well). By this, however, we are assuming 
that the two agents play in a correlated way|^|. Iteration of this argument suggests 
that agents will naturally choose one of the optimal equilibria tt^I^/^I-' if they can 
correlate their play. 

Summarizing, the single stage game has an huge number of Nash equilibria for 



^ 1 (Eq. 3.1). This raises the problem of equilibrium selection: Which of these 
equilibria describes the behavior of deductive rational agents? Unfortunately 
game theory, on the basis of rationality and common knowledge alone, does not 
provide an unambiguous answer to this question. 

Since Nash equilbria are very different in nature, with values of cr^ ranging 
from to A", this means that, in fact, we cannot draw any conclusion at all. 



Hence, as suggested by Arthur (1994), one has to abandon deductive rationality 



and replace game-theoretic players by inductively rational adaptive agents. 



t In game theoretical terms one may say that for any Nash equilibrium 7r(j' there is an 
equilibrium 7r(-'+^' which is Pareto dominant: In the latter each agent is not worse off than in 
the first and there is at least one agent which is strictly better off. 

■t We refer the reader to (Mycrson, 1997) for a detailed discussion of coordination games 
and correlated equilibria. 
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4. Adaptive agents 

Agents are adaptive, in the sense that they learn from past experience which 
choice is the best one. The past experience of agent i is stored in Ai{t) which 
is related to the cumulated payoff of the two possible actions. Ai{t) > means 
that the action = +1 is (perceived as) more successful than = — 1 and 
vice-versa. On the basis of their experience Ai{t) up to time t, the behavior of 
agents is described by 

1 ± m (t) 

Prob{a,(i) = ±1} ^ with m,{t) = x^[X{t)] (4.1) 

The function Xii^) is an increasing function of x with liTax^±oc Xi{^) = il- 
Hence the probability with which agent i plays action a.; = ±1 increases with 

The connection with mixed strategies is nii = 2ni — 1. Here we choose a 
different notation to stress the fact that rriiit) is not a mixed strategy, but rather 
it encodes a behavioral rule. Note that rm = {ai). 

The way in which {t) is updated is the last element of the learning dynamics. 
We assume that 

A,(i + 1) = A,{t) - l[A(t) - T^Mt)] (4-2) 

The idea is: if agent i observes A{t) < he/she will generally increase and 
hence his/her probability of playing = +1 at the next time step. The rji term 
above describe the fact that agent i may account for his/her own contribution 
to A{t). For j]i = 1 indeed, agent i considers only the behavior of other agents 
A(t) — ai(t) and does not react to his/her own action ai{t). 

The question of interest is: Do agents converge to a Nash equilibrium under 
this dynamics? If yes, to which equilibrium do agents converge? 

4.1. Naive agents r/^ — 

In the MG agents are naive and do not account for their impact on the aggre- 
gate, i.e. r]i = 0. 

Let us focus on the long time dynamics for N ^ 1. In particular we follow 
( Marsili et ai, 2000| ) and introduce a time variable r — t/N . A superscript ^'^^ 
denotes variables in the continuum time (e.g. A!f\T) = Ai(AT)). Then 

N{T + dT) 

A^\r + dr) = Al^\r)-- ^ A{t). (4.3) 

t = NT 

For N ^ 1 we can use the law of large numbers and conclude that the last term 
is well approximated by —dT{A)r. Here the subscript r implies that the average 
is taken over the probabilities [1 ± nif\T)]/2 of the actions ai = ±1, and we are 
assuming that Xi{^) is smooth enough (see later) so that these probabilities are 
approximately constant over the time interval [t, t + dr). Hence we find that 

N 

^t\r) = -{A)r = -Y.^t\r)- (4.4) 
1=1 
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It is quite easy to find that H is minimized along the trajectories of the learning 
dynamics with rji — for all i. 
Indeed, with the notation (. . .)r for averages taken at time t 

^ = =-2i7^xUAr^) (4.5) 

1=1 1=1 

where we have assumed that {xi{^i))T — Xi(Ai^'')i which is correct to leading 
order in 1/N. 

Since Xii^) ^ for aU x and i, we conclude that H < i.e. naive agents 
minimize the predictability H, and H—^Oast^oo. There are many states with 
H = and the dynamics select that which is "closer" to the initial condition. 
To be more precise, let Ai(0) be the initial condition (which encodes the prior 
beliefs of agents on which action is best). Then, as r — > oo, (A)^ ^Oand Af^(r) 
converges to 

POO 

Af^(oo) = A,(0) +M, with 6A= dT{A)r. (4.6) 

Jq 

The condition {A)^^ — provides an equation for SA 

N 

= Y,X^{MO) + SA). (4.7) 

i=l 

By the monotonicity property of Xi , this equation has one and only one solution. 

The asymptotic state of this dynamics is information-efficient {H = 0), but it 
is not optimal. Indeed, in general, this state is not a Nash equilibrium. Typically 
we find oc N. Only in the special case Ai(0) = and Xi(0) = for all i, we 
recover the symmetric Nash equilibrium jf^^^ — (1/2, . . . , 1/2) where = N. 



4.2. Less naive agents r]i > 



It is easy to check that with rji > 0, following the same steps as in the previous 
section, the learning dynamics of agents minimize the function 

N 

H, = {Af-Y,Tj,ml (4.8) 

1=1 

Because of the new term, 7?^ attains its minima on the boundary of the domain 
[— 1, 1]''^. In other words, nii = ±1 for all i which means that agents play pure 
strategies Oi = rui. The stable states are optimal Nash equilibria for N even. 
By playing pure strategies agents minimize the second term of iJ^. Of all corner 
states where m| = 1 for all i, agents select those with {A) = by dividing 
into two equal groups playing opposite actions. All these states have minimal 
"energy" — ~ rji. Which of these states is selected depends on the initial 
conditions Ai(0). 

Note that the set of stable states is disconnected. Each state has its basin of 
attraction in the space of Ai(0). The stable state changes discontinuously as 



Ai(0) is varied. This contrasts with the case rji = where Eq. (4.7) implies that 
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the stationary state changes continuously with Ai(0) and the set of stationary 
states is connected. 

For N odd, similar conclusions can be found. This can be understood by adding 
a further agent to a state with iV — 1 (even) agents in a Nash equilibrium. Then 
= (1 — rj[^)'m'j^, so for Ty^r < 1 the new agent will play a mixed strategy 
rrii = 0, whereas for yyjv > 1 it will play a pure strategy. In both cases other 
agents have no incentive to change their position. In this case we find cr^ < 1. 

Summarizing, when agents account for their impact on the aggregate, they at- 
tain not only an information- efficient state, but also a optimal Nash equilibrium 
with a"^ = H = 0. 

It is remarkable how the addition of the parameter rji radically changes the 
nature of the stationary state. Most strikingly, fluctuations are reduced by a 
factor N. 



4.3. Naive agents learning at a fast rate 



The difference between the behavior of naive and non naive agents becomes 
even stronger if agents are naive {rji = 0) and very reactive. In order to quantify 
more precisely what we mean, let us assume that|T| 

Xi(A) = tanh(rA), Vi. (4.9) 

Here F is the learning rate, which measures the scale of the reaction in agent's 
behavior (i.e. in nii) to a change in A^. We also assume that agents have no 
prior beliefs: Ai(0) = 0. Hence Ai{t) = y{t)/T is the same for all agents. From 
the results discussed above, we expect, in this case the system to converge to 
the symmetric Nash equilibrium m.^ — for all i. This is not going to be true if 
agents are too reactive, i.e. if F > F^,, as we shall see shortly. 
Indeed y{t) — FAi(t) satisfies the equation 

F ^ 

y{t + l) = y{t)--Y.a,{t) 

1=1 

~ y{t)~Tta.nh[y{t)] (4.10) 
where the approximation in the last equation relies on the law of large numbers 



for ^ 1. Eq. (4.10) is a dynamical system. The point y'' = is stationary, 
but it is easy to see that it is only stable for F < Fc — 2. For F > 2, a cycle of 
period 2 arises, as shown in figure 0. This has dramatic effects on the optimality 
of the system. Indeed, let zLy* be the two values taken by y{t) in this cycle^. 
Since y{t + 1) = —y{t) ~ zty* we still have (A) = and hence H = 0. On the 
other hand ~ N^y*'^ is of order N^, which is even worse than the symmetric 
Nash equilibrium where = N. 

This transition from a state where tr^ cx to a state with a'^ cx N'^ is generic 

t In this section we assume that agents follow a Logit model of discrete choice where the 
probability of choice a is proportional to the exponential of the "score" Ua of that choice; 
7r(a) oc e'"'^«/£^With only two choices a = ±1, Tr{a) = (1 + a m)/2 a.nd A = — U . 



recover Eq. (B.9b. This learning model has been introduced by Cavagna et al. (1999) 
context of the MG 



we 
in the 



■t ±y* are the two non-zero solutions of 2y = Ttaiih{y). 
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Figure 1: Graphical iteration of the map y{t) for F — 1.8 < Fc and F — 2.5 > Fc 



in the minority game, as first observed by Savit et al. (1999). Much work has 



been done by Johnson et al. to describe this effect quantitatively in terms of 
"crowds" and "anti-crows" . This approach is however static and does not reveal 



the dynamica l nature of the transition, which was first discussed in ( Challet and 
Marsih, 19991) . The transition from these two regimes as the learning rate F is 
changed has been first observed in ( |Cavagna et al., 1999 ) for the MG. 

We shall see below that the simple approach followed here can be generalized 
to the full minority game and it allows to derive the critical learning rate Fc(q;) 
as a function of the parameter a of the MG. 



5. The MG with information 

The same qualitative behavior occurs when agents have access to a public 
information represented by an integer variable /i ranging from 1 to P. fi may 



either be related to past outcomes of the game ( Challet and Zhang, 1997 ) or, as 
first suggested by Cavagna (1999) , it may be just randomly and uniformly drawn 
from {1,...,P}. The idea is that, agents resort to simple schemata or rules, 
which prescribe an action for each value of fi. Each agent is initially endowed 
with S such rules which are drawn at random among all possible 2^ binary 
functions. Agents take their choice on the basis of "scores" Ug^i which they 
assign to each strategy s, and they update scores in a way which is similar to 
Eq. (4.2). Savit et al. (1999) has shown that the relevant variable is a = P/N: 
Intensive (i.e. N independent) quantities, such as jN , display a behavior which 
does not depend on P and iV separately, but only on their ratio a. We refer the 
interested reader to (Marsili et al., 2000) for a discussion of the general case. 
Here we focus on the = 2 case, where following ( |Challet and Marsili, 1999 ; 
Challet et al, 2000 A[ ), we let s = ±1 be the label of the two possible rules of 



each agent and Si(t) be the choice actually taken by agent i at time t. Following 
the notation of ( phallet and Marsili, 1999 ; Challet et at, 2000A), the action 
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taken by agent i at time t, if he/she chooses to follow strategy Si{t) is ai{t) = 
tt'f'"*'' + Si(i)ff^*'', where ^{t) is the value taken by public information[|]. The 
decision process of agent i is hence encoded, as in Eq. (4.1) above, in the equation 



Prob{si(<) = ±1} = 



with mi{t) = Xi(Ai). 



(5.1) 



Here Ai{t) is the difference between the sco res of the two schemata, which are 
updated according to the analog of Eq. (4.2): 



A,(t + 1) = A,{t) 



me. 



M(t) 



mi, 



N 



(5.2) 



Refs. (Marsih et ai, 2000; Challet et al 



2000C) have shown that typical 
relaxation times in a system of N agents are of order N. In order to have a 
meaningful dynamics in the limit — > cx) it is necessary to introduce a rescaled 
time T = t/N. This leads to a deterministic continuum time dynamics ( [Challet 



and Marsih, 1999| ; [Challet et al, 2000 A| ) which is independent of N. The validity 
of this description relies on the fact that typically A(t) ^ a/Iv and hence |Ai(t + 
1) — Ai(t)| ^ is vanishingly small, which makes the continuum time 

approach exact as iV — s- oo. As we shall see, this fails to be true li a < and 
agents are too reactive, since then A{t) ^ N . 



Within the continuum time description one finds (Challet et al., 2000 A; Marsih 
et al., 200C) that with 77^ = for all i, the behavior of agents again minimizes 



predictability H. It can be shown (Marsili et al., 2000) that the payoff of each 
agent i increases with rji in the range rji € [0, 1]. In addition, also global efficiency 
increases. If P > acN , where ac = 0.3374 . . ., the improvement is smooth as a 
function of 77. If, on the other hand, P < a^N there is a sudden jump in cr^ as 
soon as agents switch on a small parameter rji — rj. This is shown clearly in figure 
Ij. This feature is capt ured qualitatively by the analytic calculatio n (in the replica 
symmetric ansatz) of ( Challet et al., 2000 A ; Marsili et al., 2000 ), whose result is 
shown in Figure g[. For a — P/N < ac, as the line 77 = is crossed from below, 
the system undergoes a first order phase transition with a discontinuous jump in 
cr^. Note that for F ^ Fc the jump in a'^ /N is of more than two decades! This 
discontinuity arises because the nature of the stationary state changes abruptly: 
For 77 < a finite fraction of agents play mixed strategies {mf < 1) whereas for 
77 > most of them play only pure strategies. For 7; = 1 all agents play pure 
strategies and indeed the system converges to a Nash equilibrium|t|. 

The transition for a > ac occurs at a value rjc^a) > and it is smooth 
(second order). A deeper discu ssion on the nature of this transition and on its 



consequences may be found in (Marsili et al., 2000; Dc Martino et al, 2000) 



6. Learning rate in the MG for a < ac 

As before, the performance of naive agents (77^ = Vi) may be much worse for 
a < ac a they are too reactive. This is shown numerically in figure |^. The effect 

t ui^ and ^Y^u are such that uj'^ it are two randomly choosen boolean functions of /i 
taking values in {±1}. 

t Here Nash eqnil ihria are defined with respect to the set of pure strategies = ±1 which 
agents can take. See [Marsili et al. (2000)[ for a detailed discussion. 
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Figure 2: ct^/A'' as a function of for 5 = 2 and A'' = 101. Diamonds refer 
to P = 8 (a ~ 0.079 < etc) with T = oo (open symbol) and T < Tc{a) (full 
symbol). Squares refer to P = 64 (a ~ 0.63 > ac) with F = oo (open symbol) 
and F «: Fc(a) (full symbol). 




Figure 3: Approximate theoretical estimate of global efficiency a"^ /N as a func- 
tion of a for 5 = 2 and several values of rj within the replica symmetric ansatz. 
The arrows mark the values of a to which Fig. 1 refers. 
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A(t,)/N 



Figure 4: Plot ofA{tk+i)/N vs A{tk)/N for the MG with N = 301 agents, T = oo 
and P — 16 {a — 0.053 . . .). The sequence tk is such that the information is the 
same (/i(ife+i) = ii{tk)). In the inset, the same pfot with real time t. 



is exactly the same as that discussed previously, in the absence of information 
(P = 1 or a ~ 0): As the learning rate V increases, the stationary solution A* 
looses its stability and a bifurcation to a complex dynamics occurs. This is only 
possible in the low a phase, where the stationary state is degenerate and the 
system can attain (A^) = (intended as a time average) by hopping between 
different states^ The plot of A{t+l)/N vs A{t)/N, in figure|shows that indeed 
wild fluctuations occur in one time step: a finite fraction of agents change their 
mind at each time step. This is what causes, for fixed P, the cross over, first 
observed in ( Savit et ai, 1999 ), from the linear regime u'^ to a quadratic 



dependence ^ 7V^. 

Clearly the continuum time limit, on which our analysis rests, breaks down 
because A{t) ^ N. Still one can compute the critical learning rate Tc{a) which 
marks the onset of complex dynamics. Let us focus attention on one value of 



IJ, — 1 and on the learning model of Eq. (4.9). Let us define the sequence of times 
tk such that /i(ife) = 1 for the fc'^ time. Fig. ^ shows that the dynamics of A in 
this modified time is not smooth. We define yi{k) = rAi{tk), which satisfies 

y,ik + l)^y,{k)-^ Amf^ (6.1) 

When iV 3> 1, the sum involves ^ P ^ aN ^ 1 terms and we may estimate 
it by the law of large numbers. Let y* be the stationary solution {yi(k + 1) = 
yi{k) = y*) of Eq. (p^l), then we can set yi{k) = yl + 5yi{k) and study the linear 



■t In the asymmetric phase a > ac the stationary state is unique and this effect is not 
possible. 
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Figure 5: Phase diagram of the MG {rji — 0) in the (a, F) plane. 



stabiUty of this solution. With the notation R — ^ W jP, we find 



<52/i(fc + 1) 



N 



(6.2) 



where 



tanh(y*). The solution y* is stable if the eigenvalues of T^j- are all 



smaller than 1 in absolute value. As F increases, the smallest eigenvalue of Ti^ 



becomes smaller that —1. Thanks to the results of Scngupta and Mitra (1997) 



we have an analytic expression for this eigenvalue, which is A_|_ = 1 — F(l 
^/af{l — Q)/2. The stabihty condition A+ > —1 then turns into 



F < Fc(a) = 



1 ^ 

1=1 



(6.3) 



which is our desired result. The function Q{a) is known exactly from the analytic 
solution|t| ( phallet et a/., 2000A| ; parsm et al, 20001 ). This yields a phase diagram 
in the (a,F) plane which is shown in figure |5| For a — > we find Fc —> 4^. 
As a ^ ttc, Fc converges to a finite value (~ 7.0273 . . .) with infinite slope. 
Numerical simulations suggests that, rather than a sharp transition, at Fc(a) 
the system undergoes a crossover between two distinct dynamical regimes. In the 
asymmetric phase (a > etc) the dynamics is always smooth, hence Tc{oi) = oo. 

The analysis of the stationary state for F ^ Fc and the calculation of global 
quantities such as is much more difficult than that in the smooth dynamical 



t Q{a) is given parametrically by Q = 1 — e ^ /(y^z) — (z^ — l/2)cri(z)/z^ and a = 
erf2(^)/[2z2(l + Q)]. 

t This differs from our previous result Fc = 2 without information, because with P = 1 in 
the MG half of the population has a+_i = a_^i two equal strategies. This reduces by a factor 
2 the effective number of adaptive agents, and accordingly Fc takes a factor 2. 
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phase (r <C Fc). Johnson et al. (1999) devised an approximate scheme, which ne- 
glects the dynamical aspect of the problem, but gives expressions for cr^ in good 
agreement with numerical data for a < Uc and T 3> Tc{a). A microscopic, sys- 
tematic derivation of cr^ is a challenging problem, and we believe the dynamical 
system (S.l) is the starting point. 



7. Conclusion 

In its simplicity, the MG captures and reproduces a great deal of mechanisms, 
aspects and properties of real markets. In particular it provides a deep under- 
standing of the relation between behavioral assumptions at the micro level and 
global macro behavior in a toy market. Here we have shown what properties of 
the behavior of agents are responsible for excess volatility in such toy markets. 
These are price taking behavior and excessive reactivity (fast learning). 

This leaves us with the question of whether and to what extent these con- 
clusions extend to real market contexts. Experimental studies in this direction 
promise to be very illuminating. 
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